Mining Interesting Trivia for Entities from Wikipedia
نویسنده
چکیده
TRIVIA is any fact about an entity, which is interesting due to any of the following characteristics − unusualness, uniqueness, unexpectedness or weirdness. Such interesting facts are provided in Did You Know? section at many places. Although trivia are facts of little importance to be known, but we have presented their usage in user engagement purpose. Such fun facts generally spark intrigue and draws user to engage more with the entity, thereby promoting repeated engagement. The thesis has cited some case studies, which show the significant impact of using trivia for increasing user engagement or for wide publicity of the product/service. In this thesis, we propose a novel approach for mining entity trivia from their Wikipedia pages. Given an entity, our system extracts relevant sentences from its Wikipedia page and produces a list of sentences ranked based on their interestingness as trivia. At the heart of our system lies an interestingness ranker which learns the notion of interestingness, through a rich set of domain-independent linguistic and entity based features. Our ranking model is trained by leveraging existing user-generated trivia data available on the Web instead of creating new labeled data for movie domain. For other domains like sports, celebrities, countries etc. labeled data would have to be created as described in thesis. We evaluated our system on movies domain and celebrity domain, and observed that the system performs significantly better than the defined baselines. A thorough qualitative analysis of the results revealed that our engineered rich set of features indeed help in surfacing interesting trivia in the top ranks.
منابع مشابه
Did You Know? - Mining Interesting Trivia for Entities from Wikipedia
Trivia is any fact about an entity which is interesting due to its unusualness, uniqueness, unexpectedness or weirdness. In this paper, we propose a novel approach for mining entity trivia from their Wikipedia pages. Given an entity, our system extracts relevant sentences from its Wikipedia page and produces a list of sentences ranked based on their interestingness as trivia. At the heart of ou...
متن کاملThe Unusual Suspects: Deep Learning Based Mining of Interesting Entity Trivia from Knowledge Graphs
Trivia is any fact about an entity which is interesting due to its unusualness, uniqueness or unexpectedness. Trivia could be successfully employed to promote user engagement in various product experiences featuring the given entity. A Knowledge Graph (KG) is a semantic network which encodes various facts about entities and their relationships. In this paper, we propose a novel approach called ...
متن کاملTrivia Mining from Knowledge Graphs
Trivia is any fact about an entity which is interesting due to its unusualness, uniqueness or unexpectedness. Trivia could be successfully employed to promote user engagement in various product experiences featuring the given entity. A Knowledge Graph (KG) is a semantic network which encodes various facts about entities and their relationships. We propose a novel approach called DBpedia Trivia ...
متن کاملThe Association Rule Mining System for Acquiring Knowledge of DBpedia from Wikipedia Categories
Wikipedia categories are a useful source of knowledge that is usually expressed in a noun-phrase that contains information about concepts of entities or relations among entities. In DBpedia KBs, they categorize their entities into Wikipedia categories using RDF triples. The RDF triples represent only categories of entities, but not concepts of entities or relations among entities despite the fa...
متن کاملRule Mining for Semantifying Wikilinks
Wikipedia-centric Knowledge Bases (KBs) such as YAGO and DBpedia store the hyperlinks between articles in Wikipedia using wikilink relations. While wikilinks are signals of semantic connection between entities, the meaning of such connection is most of the times unknown to KBs, e.g., for 89% of wikilinks in DBpedia no other relation between the entities is known. The task of discovering the exa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1510.03025 شماره
صفحات -
تاریخ انتشار 2015